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HDDELING ERROR ANALYSIS OF STATIONARY 


LINEAR DISCRETE-TIME FILTERS 

RAJNIKANT PATEL* AND MITSUHIIM) TODA* 

Ames Research Center, NASA, Moffett Field, Calif. 94035 

Abstract, The performance of Kalman-type, linear, discrete- time 
filters in the presence of modeling errors is considered. The discus- 
sion is limited to stationary performance, and bounds are obtained for 
the performance index, the mean-squared error of estimates for suboptimal 
and optimal (Kalnmn) filters. The computation of these bounds requires 
information on only the model matrices and the range of errors for these 
matrices. Consequently, a designer can easily compare the performance 
of a suboptimal filter with that of the optimal filter, when only the 
range of errors in the elements of the model matrices is available. 

1. Introduction. One of the problems arising in the application 
of the minimum variance optimal filter of Kalman eind Bucy [8] is that a 
design based on imperfect knowledge of the system configuration and noise 
statistics often results in poor performance. Thus, there has been con- 
siderable research on the effect of modeling errors on filter performance 
[3,5,6,10,11]. In particular, errors in prior information on state 
statistics and noise covariances [6,10,11] and in system models [3,5] 
have been considered. 

This paper is concerned with providing boun'^s on the performance for 
suboptimal as well as optimal discrete-time filters based on information 

*NRC Postdoctoral Research Associate. 


about the range of modeling errors. These results are useful from the 
practical point of view, as a designer often has Information on the range 
of modeling errors rather than a precise knowledge of the modeling 
errors. In this paper we limit the discussion to stationary conditions 
and obtain performance bounds for discrete-time filters for two types of 
errors; noise covariance errors and system configuration errors. Such 
bounds were obtained for continuous-time filters in an earlier paper [12]. 
The results reported here and in [12] are quite different from those of 
earlier work In that we do not limit the discussion to small-scale 
(differential) analysis as In [5], nor do we assume sign definiteness of 
the covariance errors as in [11]. 

This paper is organized as follows: the problem Is formulated in 

Sec. 2; in Sec. 3, two general bounds are obtained. These bounds are 
analyzed in Sec. A to obtain practical expressions when the system configu- 
ration is assumed to be known and, in Sec. 5 when the noise covariances 
are assumed to be known. The results In Sec. A and 5 are Illustrated 
by an example In Sec. 6. S°.ctlon 7 concludes with some remarks on the 
results and comments about future research. 

2. Problem Statement. Consider a time- invar! ant process described 
by 

x(k + 1) = Ax(k) + Gw(k) (1) 

where the n- dimensional state vector x(k + 1) is measured by an 
m-dimensional vector y(k + 1): 

y(k + 1) = Hx(k + 1) + v(k + 1) ' (2) 
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The i-dlmenslonal process noise w(k) and the s^d Irons ional roasure- 
ront noise v(k + 1) are assurod to be mutually Independent Gaussian 
noises with zero roan and 

E[w(k)w(j)'*'] - » Q ^ 0+ (3) 

E[v(k)v(j)'*^] - j , R > 0+ (4) 

where denotes the Kronecker delta. The system matrices A, G, 

and H are assurod to have appropriate dimensions , and A is assumed to 
be a convergent matrix. 

The optimal estimates x(k + ilk + 1) that minimizes 
J(k + 1) = E [ll x(k + ilk + 1) - x(k + 1) IP] ^ 
with observations [y(0), y(l), ...» y(k + 1)], are given by [8]: 

x(k + l|k + 1) = Ax(k|k) + K(k + l)[y(k + 1) - HAx(k|k)] (5) 

where K(k + 1) is the Kalman gain matrix specified by the recursive 
relations : 

K(k + 1) = Po(k + llk)H'^[HPo(k + l|k)H’^ + R]"l (6) 

Po(k + ijk) = APo(k|k)A^ + GQgT (7) 

and 

Po(k + ilk + 1) » [I„ - K(k + l)H]Po(k + ilk) (8) 

with x(0) and Pq( 0) provided by prior Infirmatlon on x(0) and I^j, 
the n n Identity matrix. The minimized index J(k + 1) is given 
by 


■^■A symmetric matrix W is denoted as W :• 0 (W > 0) when W is 
positive definite (semideflnlte) . Also, W > Z (W > Z) denotes W - Z > 0 


(W - Z > 0). 



tor w (matrix W) . 


denotes the Euclidean norm of a vec- 
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Min J(k + 1) - tr[PQ(k + ilk + 1)] (9) 

where tr(W) denotes the trace of a square matrix W. 

We assume that complete Information on the system matrices A and H 
«ind the noise covariances Q and R is not available, and that the opti- 
mal estimator is replaced by a suboptlmal estimator based on a model: 
Xj^(k + ilk + 1) = AjjXj^(k|k) + Kj^(k + l)[y(k + 1) - Hj^Aj^XjjCk |k) ] (10) 

where = A -H AA and = H + AH are model representations of A 

and H, respectively, and AA and AH denote modeling errors. The gain 
RnCk + 1) can be computed from the model matrices using the following 
relations : 

Kjj(k + 1) = Pjj(k + l|k)H^'^[HjjP^(k + l|k)Hjj‘^ + (11) 

P^(k + ilk) = Aj^jj(klk)Ajj'^ + (12) 

atid 

Pjj(k + ilk + 1) = [ 1 ^- Kjj(k + l)Hjj]Pjj(k + ilk) (13) 

where Qm = Q + AQ and = R + AR are the model representations of 

Q and R, respectively. The mean-squared error of this estimate is 
expressed by 

E pi x^(k + 1) - x(k + 1) II = tr[P(k + 1)] (14) 

In (14) and in the remainder of this section, the index (jlj) is 
denoted by (j) wherever appropriate. The covariance matrix P(k + 1) 
is described by [11] 

P(k + 1) = L(k)A^(k)A^\(k)^ + AC(k)V(k)A^^L(k)^ + L(k)Aj^V(k)'^AC(k)'^ 

+ AC(k)U(k)AC(k)'^ + [L(k) + Kj^(k) AH]GQG’^[L(k) + K^(k)AH]'^ 

+ K^(k;SKj^(k)’^ (15) 
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V(k + 1) - AV(k)A^^L(k)^ + AU(k)AC(k)’^ - bQ^[L(k) + K^(k)AH3 

U(k + 1) - AU(k)A" + GQg'*^ 

where 

L(k) 5 I„ - K^(k)Hj^ 

AC(k) S L(k)AA - Kj^(k)AHAj^ + Kj^(k)AHAA 

P(k) = Eix(k)x(k)^] 

V(k) = E[x(k)i(k)^] 

U(k) = E[x(k)x(k)^] 
x(k) = ^(k) ~ x(k) 

P(0) = -V(0) = U(0) = Ej[x(0) - 5(0)][x(0) - x(0)]’’j 
x(0) H E[x(0)] 

Since the modeling error matrices AA, AH» AQ, and AR are generally 
not known exactly, (15) to (17) cannot be solved to obtain tr[P(k + 1) ] , 
the performance of the suboptlmal filter. However, a designer usually 
has estimates of the magnitude of errors In the model matrices, e.g., 
[AAj^jj in the (i,j)th element of Am- Therefore, It Is reasonable to 
obtain upper bounds for tr[P(k +1)] based on such estimates. It Is 
also helpful to obtain lower bounds for tr[?Q(k +1)] as functions of 
the estimates so that the designer can evaluate the performance degrada- 
tion, tr[P(k + 1)] - tr[P^(k +1)], which he should expect with the 
possible modeling errors. Such bounds are obtained In later sections 
for stationary conditions. 

3. Performance Analysis for Stationary Conditions. The covariances 
of estimation errors are constant matrices for stationary conditions, l.e 
in (6) to (8), Po(k + ijk) = Po(klk - 1) and P^Ck + l|k + 1) - Po(k|k) 


(16) 

(17) 

(18) 
(19) 
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and, in (15) to (17), PCk + 1) - P(k), V(k + 1) - V(k), and 
U(k + 1) B U(k) for stationary conditions. Hence the stationary flltei- 
Ing error covariance matrix for the suboptlmal filter Is denoted uy P 
and the stationary filtering and (one-step) prediction error covariances 
for the optimal filter are denoted by Pp and Pp, respectively. In this 
section, general expressions for an upper bound for tr(P) and a lower 
bound for tr(Pp) are obtained. These are specialized In later sections 
to obtain more practical expressions. 

In the sequel , the notation W ® Z is used for the Kronecker 
product of matrices W and Z. The colvimn string of an n x n matrix W, 
dtncted by cs (W) , is defined by the following n^-dlmensional column 
vector; 

T 

C3(W) = • • • '^nl * ^^12 • * • '^n2 * • • •» ^in • • • '^nn^ 

where Is the (j,k)th elen^nt of matrix W. Note that 

tr(W) - [cs(Ijj)]'‘^cs(W) (20) 

and 

l|cs(W)|l - l!w|| (21) 

Theorem 1 : An upper bound for tr(P) is given by 

tr(P) < Xj = iV^b + l|(irl)^i|| II Dll (22) 

( 

where 

*■ = - ‘Sfti <24) 

1 = csdjj) 

b i cs(B), B i LGQjjGV + (25) 
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Remark ; Note that in the expression for only D contains the 
error matrices AA, AH, AQ, and AR. In Secs. 4 and 5, the term || D || 
is analyzed further to obtain more explicit expressions. 

Proof of Theorem 1 ; Equation (15) can be written as 

P - + B + D 

and, in Kronecker form, as 

cs(P) = [(hAjj) ® (LAjj)]cs(P) + cs(B) + cs(D) 

Recalling (20) , we obtain 

tr(P) = l^cs(P) = iV^b + iV^cs(D) 

< iVlb + ||(M-l)'^l|| ||cs(D)[| 

where the Schwartz inequality was used to obtain the ineqiiallty. Noting 
(21) , we obtain (22) , thereby completing the proof. 

Next, we obtain a lower bound for the filtering error covariance 
matrix of the optimal filter. 

Theorem 2 ; A lower bound for tr(Pj.) is given by 

-a^ + + 4ajag 

tr(Pp) > TT 2 = i (28) 

where 

E ||aVr-1ha(| (29) 

® II W II ^ H max[X(WW^) ] denotes the spectral norm (maximum singu- 
lar value) of a matrix W; max X(W)[min X(W)] denotes the maximum 
[minimum] eigenvalue of a symmetric matrix W. 
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.1 


T 

H2 s n + tr(HVlHGQG ) - (30) 

1*1 

and 

33 = tr(GQG^) (31) 

Proof of Theorem 2 ; From (6) to (8) , we obtain 

+ hV1h(APjA'^ + GQg'^) - Ppl(APpA'^ + GQG^) 

Taking the trace of both sides yields 

n + trEMVlHCAPpA*^ + GQg"^) ] - tr(P"^APpA^) + tr(P"^GQG^) 

Applying Lemmas 1 and 2 (see Appendix), we obtain the following 
Inequality: 

n T 

n + II aVr“ 1 ha|| ^tr(P ) + tr(HV^HGQc’^) > 2 I (A) P + <32) 

which can be rearranged in the form 

aj[tr(Pp)]2 + a2tr(Pp) - Sg > 0 

Solving the above inequality for tr(Pp), we obtain (28), thereby com- 
pleting the proof. 

4. Pei’formance Bounds with Inaorreat Noiso Covopianoes. We now 
consider the case for which A and H are known exactly, i.e., AA = 0 
and AH = 0, and we obtain bounds for tr(P) and tr(Pp) in terms of 
AQ and AR explicitly. 

Theorem 3 t If AA = 0 and AH = 0, then an upper bound for 
tr(P) is given by 

tr(P) < TTg = iVlb + l|(tfl)^il| (11 LG II 2 II AQ II + IlKj^ll^ll AR||) 

(33) 


8 


Proof of Theorem 3: Since AA ■ 0 and AH ■ 0 In (26) , 


II B II - II LG4Qg’'l'' + II 


< |lLG||2||iQ|| + ||Kj,l||||iR|| 


where the last inequality follows from the Inequality || WZ || < Ij W || || Z || 

s 

[2, p. 37]. The result In (33) then follows from (22). 

/ Theorem 4 ; If AA * 0 and AH ■ 0, then a lower bound for 

tr(Pp) Is given by 


tr(Pp) > iT^ 


-a 


2 


+ 


+ 40^03 

2aj 


(34) 


where 


II ha p 

[_8 

min X(Rjj) - ||Ar11 


(35) 


IIhg|P(||q„||,+ lU-jll) 

02 = n + 

min X(Rj^) - I |AR| I 


n 


1-1 


(36) 


and 

r.3 = tr(GQ^G'^) - A|iG|| 2|||AQ(i (37) 


Proof of Theorem 4 ; In (28) , decreases monotonically as a^ 

and a 2 Increase and as decreases. Hence, to obtain the botmd 

we bound a^ and a 2 from above and a^ from below In terms of 

II AQ 11 and i|AR|| . This is done by bounding || a'^h'^R-^HA H and 

s 

tr(H^R"^HGQG^) from above and tr(GQG"^) from below. 

7 7 

An upper bound for || A H R"^HA|| is given by 

s 


I aVr-^haI 


< 

8 ” 



llHAll^ ^ |1 ha|12 

‘ min X(R^) - llARll 


(38) 


9 


where Weyl's Inequality [4, p. 157] Is used to obtain the last Inequality. 
An upper bound for tr(H R ^HGQG ) is given by 
trCHVlHGQc’^) < tr(G'^HVlHG) |j Q || 

s 

i tr(HGcV) ||R-MIJ!Qllg 
IIhgP(||q^II,-^ IUqH) 

^ min X(Rjj) - ||ar|| 

where Lemma 1(1) in the Appendix is used to obtain the first two 

T 

InequallLles. A lower bound for tr(GQG ) is obtained as follows: 
tr(GQG^) - tr[G(Q^ - AQ)g'*’] 

« tr(GQj^'*’) - tr(G’’^GAQ) 
i tr(GQj^'^) - SL II g’^GAP II g 
> tr(GQjjG^) - i||G|| 2|j aqII 

T 

where the first inequality follows since G GAQ is an matrix. 

Using (38) to (40) in (29) to (31) results in (34), thereby completing 
the proof. 

Remark : For AQ > 0 and AR > 0 (respectively, AQ < 0 and AR < 0), 

Nlshimura [11] has given bounds on the matrix P of the form Pjj ^ P 

(respectively, P > P„). These results also hold for the nonstationary 
“W 

case. Note that the results of Theorems 3 and 4 do not require any sign 
definiteness for AQ and AR, although they are limited to the stationary 
case. However, unlike the analysis in [11], the results of the theorems 
do not require that the filter gain be obtained by a Kalman filter 

design ((11) to (13)). 

5. Performanae Bounds with Inoorr^^t Process Configuration. We now 

assume that AQ - 0 and AR - 0, i.e., Q and R, are known exactly. 

Theorems 5 and 6 then give an upper bound for tr(P) and a lower bound 
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for tr(P„), respectively. The analysis used to obtain the upper bound 
for tr(P) Is HKjre Involved than that In Sec. 4 since (15) Is coupled 
with (16) and (17). To derive an upper bound, we first analyze (17) for 
the stationary case to obtain the following. 

Lemma ; If 1^ dlagonallzable, then, for stationary U, 

(41) 

where 

a = max[Re A(Fj^)] < 0 (real part of dominant eigenvalue of F^) 

K = j| T II 11t"^ II (spectral condition number of matrix T) 

3 S 

i similarity transformation matrix to diagonalize 
and U,j is the solution of the Lyapunov equation: 

■'m\ * Vm ' -<In - ‘ V 

Remarks ; (a) It is easy to show that a similarity transformation 

matrix T that* diagonalizes also diagonalizes F^. 

(b) The diagonal Izab 11 Ity condition in the above lemma can be 

removed by the following procedure: when a model matrix A^ is not 

dlagonallzable, we can always find a dlagonallzable matrix A^ such 
that II Aj^ - A^llg is ss small as we wish [4, p.lll]. The lemma can 
then be used for such A„. 

(c) The (Cayley^ transformation in (42) maps the eigenvalues of A^ 
inside the unit circle to the left-half complex plane, Re(X) < 0. 
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Proof of LeBBaa ; For stationary U, (17) can be written as 


D - + AAUM^ - AAUAj^^ - A^^^UAA + GQG*^ (43) 

From (42), 

- (1„ - + F^'') (M) 

T 

Pre- and postmultiplying (43) by (I - F,. ) and (I - F,.) respectively, 

n fi II M 

and substituting for from (44) yields: 

¥^V + * - J (In ~ ^UAa’*’ - AAUAjj’^ - A^^UAA^) (1^^ - ¥^) 

Therefore, U can be expressed by the following Integral [1, p. 239]; 

” • f "m - i / |“f(f„^«ki„ - 

- AAUAA^)(I„ - Fjj)[exp(F^t)]dt 
which yields the norm inequality 

•^o 

+ 2 II AA II II Ajj II ^ II exp(Fjjt) II g II U || ^ dt] 

Applying the Bellman-Gronwall lemma [7, p, 420] yields 

II 0 II 3 i ^ II «H II 3 exp [II I„ - II 2 (li iA II 2 

^aa 

+ 2||AA|| ||Aj^||g)j II exp(F^t) II 2 dt] (45) 

Since T diagonalizes F„, it follows that 

n 

exp (Fj|t) = T exp(At)T~^ 

where A is a diagonal matrix with the eigenvalues of F^^ on its 
diagonal. Therefore, 

1! exp(Fj^t) 11 g < II T II g 11 T"MI g II exp (At) ll g “ k exp (at) 
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Hence 


I* II exp(Fj^t) II ^ dt < f k 2 exp(2at)dt « 

•o Jo 

Substituting the above result Into (45) yields (41). 

We can now derive the following upper bound for tr(P) from 
Theorem 1. 

Theorem 5 : If AQ = 0 and AR = 0, then an upper bound for tr(P) 


Is given by 


tr(p) <115 = 


Pi 2 i/ri|(M-i)'^iit ||u^!|^(i|l||^||aa|| 

IIKmIIs" 

and 

u2siVib+ II (iT1)’’i||[b(||l||^||aa|| + |I || ^ || || ^ || ah || 

+ IlKj^llgll AH|| ||aa|()2+ ||Kj^||2|[gqg'^|| J|ah||2 
+ 2 i|K^I!g|[ LGQG^ II J| AH 11] 

Proof of Theorem 5: Since AQ = 0 and AR = 0 In (26), 


D II < 2 II LGQg'^AH^K^'^ II + II K^HGQgWk^^ || + 2 || ACVA^'l" || 


T T T 

1 *■ A * 


+ ACUAC 


< 2 II LGQG^ II s II ISlII s II II II Si II s II II s II '''* 
+ 2i|AC|l ||V||^||LA„|1^+ |UC||2||U||^ 
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From (27) , 

llacil £ lull J|4A|| + IlK^IIJlA^II JiMlI + ||K„||,||M|| ||AH || 

( 48 ) 


Since 



is a covariance matrix, it is positive semidef inite , and with P > 0, 
Lemma 3 (see Appendix) yields 

I V II , 4 II P II II U II s [tr(P) J 1/2 II u II \n (49) 


Using inequalities (47) to (49) In (22) , we get a quadratic inequality 
in [tr(P)jl/2: 


tr(P) - 2pj[tr(P)]l/2 - i 0 


which yields (46) . 

Theorem 6 ; If AQ = 0 and AR = 0, then a lower bound for tr(P ) 
is given by 


tr(Pp) > TTg 


-Y2 + /y2^ + 4YjY3 
2Yi 


(50) 


where 

lUHtl)2 1lR-Ml3(Kll3+ I|AA|1)2 

Y 2 =n + (||H.^I! 3 + ||ah1|)2||r- 1||^ tr(GQG^) 


and 

Y 3 2 tr(GQG^) 

where denotes the (i,i)th term of AA. 
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Proof of Theorem 6 ; As in the proof of Theorem 4, we bound and 
from above and a^ from below In (28) to obtain TTg . Therefore , 


a, - 1 |aVr-iha||^s I|h||2||r-‘|I J|a||2 

i (II»mII,+ llAH||)Mls-Ml3(l|A„ll3 + I|m||)2 -y, 

8j - n + tr(HV'HOQG’^) - J 

1-1 ^ 

< n + |!hV1h|| tr(GQc’^) - ^ [tr(A)]2 

S u 

- “ + (II “m H s II II) ^ II II c 

- ^ ftr(Ajj) - tr(AA)]2 

where Lemma 1 (1) and Lemma 2 are used to obtain the first Inequality for 


32 • Since 


[tr(Ajj) - tr(AA) ]2 > ^ tr(A^) | - |tr(M)|J 


ltr(AA)| < 

1-1 

it follows that a^ < y^. Finally, we note that a^ * Y3 and that in ( 28 ) 

ir^ decreases monotonically as and 82 increase. Consequently, the 

above bounds suffice to obtain the bound 7 t_. 

6 

6. Example. To illustrate the results presented in the preceding 
sections, we consider a process with the following state space description: 


x(k + 1) 


rM2 ^3 


“1 fi 

x(k) + V 

«iJ 1 ° 


y(k + 1) = [1 0]x(k + 1) + v(k + 1) 
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The nuiaerlcal values for a loodel are specified a& a^^ 

and a^^^ ■ -1/4. The modeled noise variances of the zero mean white 

noises are 

’m ■ ■ 5 

The parameters a^ and a^^ are assumed to be correct and the modeling 
error bounds for the parameters a^^^ , and r^^ are given by 

|Aa^| < 0.1 , |Aq| < 1 , and |Ar| < 0.5 

For the numerical values specified. Theorems 3 to 6 yield the 
following bounds: 

Theorem 3: 

Case 1 '^3 “ 3.567 (3.519) 

Case 2 1:3 = 3.748 (3.644) 

Theorem 4: 

Case 1 ir^ = 2.206 (3.295) 

Case 2 tt^ = 2.432 (3.171) 

Theorem 5; = 4.246 (3.450) 

Theorem 6: ^’0 = 2.506 (3.390) 

For con?>arison, the least upper bounds for tr(P) and the greatest 
lower bounds for tr(Pp) are given in parentheses. Cases 1 and 2 for 
Theorems 3 and 4 denote the following: 

Case 1: jAqj < 1 , Ar = 0 

Case 2: jAr] <0.5, Aq = 0 
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7. Conalusions. The performance of Kalman-type, stationary, linear, 
discr«te-time filters in the presence of modeling errors has been analyzed. 
The mean-square error of the estimates was used as clie performance mea- 
sure; modeling errors in the system configuration (AA and AH) and in the 
noise covariances (AQ and AR) were considered. Upper bounds for che per- 
formance measure of subop tlmal filters with modeling errors are given In 
Theorems 1, 3, and 5 and lower bounds for the optimal filters without 
errors are given In Theorems 2, 4, and 6. The bounds In Theorems 3 to 6 
require knowledge of only the model matrices and the range of errors of 
these matrices. Consequently, these bounds are useful in practice, as a 
designer often has Information on the range of modeling errors rather 
than on the exact values of the error matrices. 

It has been iiiq>llcltly assumed In the derivation of the bounds that 
stability of the system and the filter Is preserved In the presence of 
modeling errors AA and AH and that sign definiteness of Q and R Is 
preserved In the presence of modeling errors AQ and AR. It should also 
be noted that the bounds obtained in this paper may be conservative for 
some systems, e.g. , those with very small stability margins, and It may 
be desirable to obtain tighter bounds for specific cases. 
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8. Appendix: Inequalities for Positive Semidefinit^ Matriaes. Some 


of the Inequalities used In the proofs of the theorems presented in n 
text are proved below. 

Lemma 1 : (1) If A > 0 and B > 0, then 

tr(AB) < i|A|| tr(B) < tr(A)tr(B) 
s 

(11) If A > 0 and B > 0, then 

^ >- fgf 

Proof of Lemma 1 : (1) Since A > 0 and B > 0, A^/^ and B^/^ 

I 

exist; hence 

tr(AB) = trCA^/^B^/^Bl/^Al/^) 


= ||a1/2b1/2||2 < ||a1/2||2|,b 1/2||2 = || A I| ^tr(B) 

and since A > 0, ]| A 1| ^ < tr(A), thereby completing the proof. 
(11) To show the second iiife^uallty , we write 

II A II tr(A-lB) . ||A‘/2P tr(A-l/2Bl/2Bl/2A-l/2) 

s s 

. ||Al/2p||A->/2Bl/2||2 2 || B1/2 || ^ - tr(B) 
s 


and since A > 0, tr(A) > ||aI|^, thereby completing the proof. 
Lemma 2: If A > 0 and (|> is an n x n matrix. 


tr(A~^<|)A(|>'^) > ^|A^(<|))P > [tr((j))]' 


Proof of Lemma 2 ; 

tr(A”^():A<j)^) = tr (A“^/^<|)A^/^A‘/^(j)^A“^/^) 

« ||A-^/2^Al/2]i2 

n 

Since il W II 2 > ^ I A . (W) | 2 for any n x n matrix W and 
1=1 ^ 

A^(A“^/2,|,a1/2) s X^(<()), 
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tr(A-HA<|)'^) > 2 lX.C«fr)|2 

i-1 

Using Cauchy's inequality [9, p. 42], we have 




The result of the lemma then follows since 


i*l 


* J ^ 


i«l 


Lemma 3 ; If A > 0 is partitioned as 


[a. A,n 

[As A,J- 


where Aj^ > 0, then 


IIMIJ|a,||^> IIA3 

Proof of Lemma 3 ; Since A^^ exists: 

0 


s 




o1|Ai 

^Jb 


A2 " A3Aj^Aj 




T 

Therefore, A > 0 implies that Aj -■ AjA^^Aj 2 0. Using Weyl'j 
Ity for eigenvalues [4, p. 157], we have 

max XCA^) - max X(AjA“^Aj^) > min XCAj - AjA^^Aj^) i. 0 


II A^ I! g > max XCAjAj^Ag'^) 

■ max X(A"^/^Ag'*'AgA”^/^) 

= !|A7l/2A3Vr/^Il3 


inequal- 
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Therefore, 



thereby completing the proof. 
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